Emotional Sentence Annotation Helps Predict Fiction Genre
نویسندگان
چکیده
Fiction, a prime form of entertainment, has evolved into multiple genres which one can broadly attribute to different forms of stories. In this paper, we examine the hypothesis that works of fiction can be characterised by the emotions they portray. To investigate this hypothesis, we use the work of fictions in the Project Gutenberg and we attribute basic emotional content to each individual sentence using Ekman's model. A time-smoothed version of the emotional content for each basic emotion is used to train extremely randomized trees. We show through 10-fold Cross-Validation that the emotional content of each work of fiction can help identify each genre with significantly higher probability than random. We also show that the most important differentiator between genre novels is fear.
منابع مشابه
Similarity Based Genre Identification for POS Tagging & Dependency Parsing Experts
POS tagging and dependency parsing achieve good results for homogeneous datasets. However, these tasks are much more difficult on heterogeneous datasets. In (Mukherjee et al., 2016, 2017), we address this issue by creating genre experts for both POS tagging and parsing. We use topic modeling to automatically separate training and test data into genres and to create annotation experts per genre ...
متن کاملDifferent Flavors of GUM: Evaluating Genre and Sentence Type Effects on Multilayer Corpus Annotation Quality
Genre and domain are well known covariates of both manual and automatic annotation quality. Comparatively less is known about the effect of sentence types, such as imperatives, questions or fragments, and how they interact with text type effects. Using mixed effects models, we evaluate the relative influence of genre and sentence types on automatic and manual annotation quality for three relate...
متن کاملAWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis
We present AWATIF, a multi-genre corpus of Modern Standard Arabic (MSA) labeled for subjectivity and sentiment analysis (SSA) at the sentence level. The corpus is labeled using both regular as well as crowd sourcing methods under three different conditions with two types of annotation guidelines. We describe the sub-corpora constituting the corpus and provide examples from the various SSA categ...
متن کاملA Comparison of Relationship between Text and Picture in the Selected Iranian and Contemporary American-European Illustrated-Fiction Books Based on the Theory of Maria Nikolajeva and Carole Scott
Illustrated-fiction books are special forms of art that are the combination of text and picture. The relationship between text and picture in this genre is diverse and variegated, and has different effects on the audience; however, little research has been done about it. The goal of this research is to compare text/picture relationship in the selected Iranian and contemporary American-European ...
متن کامل“But Is It Science Fiction?”: Science Fiction and a Theory of Genre
If science fiction is a genre, then attempts to think about the nature of science fiction will be affected by one’s understanding of what genres are. I shall examine two approaches to genre, one (genres as regions of conceptual space) dominant but inadequate, the other (genres as historical particulars) better, but only occasionally making itself seen. I shall then discuss several important, in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 10 شماره
صفحات -
تاریخ انتشار 2015